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STRINGS OF CONGRUENT PRIMES IN SHORT INTERVALS 

TRISTAN FREIBERG 



Abstract. Fix e > 0, and let pi = 2,p2 = 3, . . . be the sequence of all primes. We prove 
that if {q, a) = 1 then there are infinitely many pairs pr,Pr+i such that Pr = Pr+i = « mod q 
p^ • and Pr+i — Pr < elogPr- The proof combines the ideas of Shiu [9] and Goldston-Pintz- 

wp,. Yilduim [6]. 

< 

(N 

^ i 1. Introduction 

r^ • Fix any e > 0. In 2005, Goldston, Pintz and Yildirim proved [4, 6] that there are arbitrarily 

large x for which there are at least two primes in the interval (x,x + elogx], thus establishing 
the longstanding conjecture that there are infinitely many pairs of consecutive primes pr.,pr+i 

(S| '■ with Pr+l - Pr < e logPr- 

In [5] they extended their original argument to prove that there are arbitrarily large x for 



t^^ ■ which there are at least two primes in the interval {x,x + elogx] which are both in the 

• I arithmetic progression a mod q, provided {q,a) = 1. However one cannot deduce that these 

O I are consecutive primes for there might be a prime in-between them that is not = a mod q. 

Hence one can only deduce that either there are infinitely many pairs of consecutive primes 

Pr = Pr+i = a mod q with Pr+i — Pr < elogpr, or that there are infinitely many triples of 

^ ■ consecutive primes pr,Pr+i,Pr+2 with pr+2 — Pr < elogpr- Presumably both statements are 

c^ ' true but one can only deduce that one of them is true, and one does not know which one, 

from the result in [5]. 

In [9], Shiu proved an old conjecture of Chowla that there are infinitely many pairs of 
consecutive primes pr,Pr+i which are both = a mod q. Indeed he was even able to extend 
this to k consecutive primes. In this paper we will combine the methods of Goldston-Pintz- 
Yildirim and of Shiu to establish the following hybrid of those results: 

Theorem 1.1. Let g ^ 3 and a be integers with {q,a) = 1, and fix any e > 0. There 
exist infinitely many pairs of consecutive primes pr,Pr+i such that pr = Pr+i = a mod q and 

Pr+l -Pr < elogPr- 



2. Preliminaries 

In this section we will state two key technical propositions, to be proved in sections 4 and 
5. The first proposition requires some preparation. We begin by quoting the Landau-Page 
theorem, a proof of which can be found in [2, Chapter 14]. This theorem is used to handle 
problems arising from possible irregularities in the distribution of primes, hence in Bombieri- 
Vinogradov type theorems (see Lemma 4.2), caused by potential Siegel zeros. 

Lemma 2.1 (Landau-Page theorem). There exists a constant c such that the following holds 
for any Y > c. There is at most one integer go ^ Y , CLnd at most one real primitive character 
Xo mod go) such that 

L(l — S, Xo, Qo) = for some S ^ 



31ogF 

If qo exists, then q^ > (logF)^. We call Xo o't- exceptional character and go o.'it- exceptional 
modulus. 

Throughout, we fix a number e > 0, we let ff be a real parameter tending monotonically to 
infinity, and we set A^ := exp{H/e), that is H = elogN. If there is an exceptional modulus 
gp := qQ(H) ^ exp{H/e{log{H/e)Y) = A^V(iogiogJV)2^ ^^^ ^^ _ ^^^^^ ^^ .^^ greatest prime 

factor; otherwise let po = 1. 

For all sufficiently large H, either 

Pq = 1 OT Po is a prime with po > logH. (2-1) 

To see this, note that all real primitive characters are products of Legendre symbols with 
different odd primes, and possibly either the unique real character mod 4 or one of the two 
primitive real characters mod 8. Thus if go exists it is of the form 2°pi ■ ■ ■ p^, where a ^ 3 
and the p^'s are distinct odd primes. If this is the case and Pq ^ log-ff, then the prime 
number theorem implies go <^ exp((l + o(l))logif) ^ logA^, but Lemma 2.1 states that 
go> (logiV/(loglogiV)2)2. 

We let Q := Q{H) be a positive integer, upon which we will impose the following conditions: 

Q is composed only of primes p ^ H, (2-2) 

Q is divisible by all primes p ^ logiiT, (2.3) 

Q ^ exp [cH/ {log HY) for some constant c > 0, (2.4) 

a Po{H) 7^ 1 then po{H) does not divide Q. (2.5) 



We let 

n:={Qx + h,...,Qx + hk}, h,,...,hke[l,H]nZ, (2.6) 

denote a set of distinct linear forms, and we define 

Aj,(n;n,j):=\^ ^' MilogR/dy, (2.7) 

■^ ' d\P{n;n) 

where Yl denotes summation over indices coprime with Qpo, and 

P{n-n):=iQn + hi)---{Qn + hk). (2.8) 

Finally, we let 

{loer n if ra is prime, 
otherwise. 

Proposition 2.2. Given e > and sufficiently large H , let N and pq = po{H) be as 
defined earlier, and let Q = Q{H) he a positive integer satisfying (2.2) - (2.5). Fix positive 
integers k and i, and let "H = {Qx + /ii, . . . , Qx + hk} be a set of distinct linear forms with 
hi, . . . ,hk G [1, H]nZ and {Q, hi, ... , hk) = 1. Let h E [\,H]\~\X and suppose {Q, h) = 1, 
and let R = N^/'^~'^ for some e' E (0, 1/4). As H ^ oo, we have 

' — ^2A (log/2)'=+2^ 






and 



1 fHQ)\ V- 

N\ Q J ^ 



N<n-S,2N 
k 

^^ ^{Qn + h)An{n;n,k 

N<nii2N 



I J {k + 2f)\ 



(2.9) 



r^/2A(log^Rr!! ,fQx + h^H 
m)\t) {k + 2i)\ 'tQ^ + f^^^' 

2(£+l)\ (logi?)'=+2W 



(2.10) 



+ 1 J {k + 2i+l)\ 



tfQx + hen. 



Proposition 2.3. Let g ^ 3 and a be integers with {q,a) = 1, and for a given H, let 
Po = Po{H) be as defined earlier. There is an infinite sequence of integers Hi < H2 < . ■ ■ 
such that for any i, taking H = Hi, there exists a positive integer Q = Q{H), divisible by q 
and satisfying (2.2) - (2.5), such that 

\S\-\T\»,H(^^y (2.11) 



where 

S = S{H) := {h G (0, H] : (Q, h) = 1 and h = a mod q}, 

(2.12) 
T = r(/7) := {/i G (0, i/] : (Q, h) = 1 and h ^ a mod g}. 

T/ie implied constant in (2.11) depends at most on q. 



3. Proof of Theorem 1.1 

Fix integers q ^ 3 and a with (g, a) = 1. Recall that H = elogN, with e > fixed, and po is 
the greatest prime factor of the exceptional modulus go ^ N'^/i^^s^os^) ^ jf j^ exists, otherwise 
Po = 1. We choose H, Q = Q{H), S = S{H), and T = T{H) as in Proposition 2.3, so that 
Q is divisible by g and satisfies (2.2) - (2.5), and 

for some constant c(g) > 0, depending on g at most. 

We fix positive integers k, i (to be specified later), and we let "H = {Qx + hi, . . . , Qx + h/.} 
be a set of distinct linear forms such that, for each i, hi G [1, H]r\a mod g and {Q, hi) = 1. 
We let R = N^/^-^' with < e' < 1/4 (to be specified later), and we put 

^ ■ = 



\ ^ ^ N<n^2N \heS 



N\-0~) ^ .^W^ + /^)-}_^^(gn + /i)-log3giV|A^(n;?/,fc + 

^ ^ ' N<n^2N \heS heT 

We now show that if ^ > for a sequence of numbers A^, tending to infinity, then Theorem 
1.1 follows. 

Let 

^n '■= {p ^ {Qn, Qn + H] : p = a mod q} = {p '■ p = Qn + h,h E S} 
Bn '■= {v G {Qn, Qn + H] : p ^ a mod q} = {p '■ p = Qn + h,h E T}. 
If ^ > 0, then there is some n G (A^, 2A^] such that 

\Ar,\ log(Qn + H)^ Y^^iQn + h) > ^?9(Qn + h) + \og3QN ^ \B^\ logQn + log3QiV. 
has heT 

Now 

|A„| log (1 + H/Qn) ^ \A^\ H/Qn ^ H^QN < log(3/2) 
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if A^ is sufficiently large, and so 

log(3/2) + (lAJ - \Bn\)\ogQn > \og3QN 

and hence, as n ^ 2N, \An\ — \Bn\ > 1- But as these are integers, \An,\ ^ \Bn\ +2, and so, 
by the pigeonhole principle. An contains a pair of consecutive primes pr,Pr+i- These primes 
satisfy pr+i — Pr < H < elogQN < elogpr- 

Now, by our choice of "H, a straightforward application of Proposition 2.2 yields 
,'2i\{\ogR)''+^^ 



ij {k + 2i)\ 



f Q ^ , , 2(2£ + l) logR ^ ^ \-i n^ n\u '.haA 

Qx+h(^'H Qx+hen 

We have 

E 1 = ^' 5^ 1 = 1^1 -A;, 

Qx+hen Qx+h<^n 

logR = (1/4 - e') log A^, and log3QiV ~ logiV by (2.4), therefore 

/2A(iogflr!! 

f Q |S|-|r| , 2(2f+l) fc ^1 A 

We have written o(l) for kQ/{(j){Q) \ogN), because Q/(t){Q) <^ loglogQ ^ loglog A^. 

By choosing i = [vk] and k sufficiently large, the bracketed expression {■ ■ ■ } above is, by 
(3.1), 

^ c(g)e + 1 - 5e' - (1 + o(l)) = c(g)e - 5e' - o(l). 

By choosing e' = c(g)e/10 (we may assume that e is small enough so that e' < 1/4), we 
deduce that 

^>fcc(g)e(logA^)'=+2m (3 2) 

holds if A^ is sufficiently large. By Proposition 2.3, we may choose H, equivalently A^, from 
a sequence of numbers tending to infinity, and Theorem 1.1 follows. 



4. Proof of Proposition 2.2 

The estimates (2.9) and (2.10) of Proposition 2.2 are essentially the same as estimates already 
in the literature, so we will only outline a proof of each of them, referring to [3] and [5] for 
details. 

Let Q = Q{H) satisfy (2.2) and (2.3). For a set of distinct linear forms 7i, as in (2.6), and 
positive integers d, we define 

n{d) = n{d; H) := {n mod d : P{n; Ti) = mod 4, 

where P{n; T-L) is as in (2.8). A Chinese remainder theorem argument shows that n mod d G 
fi{d) if and only if p*" || P{n\'H) for every p^ \\ d, and so |f^((i)| defines a multiplicative 
function of d. Thus, if we define 

jj^MihgR/dy iid^R, 

^R{d;j) := < ■'■ (4.1) 

lo iid>R, 

we see from (2.7) that 

AR{n;n,j):=\_ J^' M{\og R/ dy = J^' A«(rf;j). (4.2) 

d\P(n;-H) nmodd 

We call l-L admissible if |f2(p)| < p for all p, and one can prove that this is equivalent to 
G{U) 7^ 0, where 

e,«,.nO-T)0-^)"' 

is the singular series for l-i. 

Lemma 4.1. Let H he a real number, let Q = Q{H) he a positive integer satisfying (2.2) 
and (2.3), and let l-L he as in (2.6), with k fixed. We have 

\n{p)\ = k for all p>H. (4.3) 

For k ^ logiJ, H is admissible if and only if {Q, hi- ■ ■ h^.) = 1. Moreover, as H ^ oo, for 
{Q, hi- ■ ■ hk) = 1 we have 

Proof. For primes p that do not divide Q, we have 

^(P) = {-hiQ^^, ■■-, -hkQ'^} mod p, 
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and hence 1 ^ |f^(p)| ^ niin(A;,p). For such p, we have |f^(p)| = k ii and only if the —hiQ ^ 
are all distinct modulo p, that is if and only ii p\ A, where 

A = A{n):= Y[ \h,-hj\. 

By (2.2), p > H implies p\ Q, and since 1 ^ \hi — hj\ ^ H for every i,j, p > H also implies 
p\ A, and hence |f2(p)| = k. We have established (4.3). 

If some prime p divides {Q, hi- ■ ■ hk), then P{n; H) = hi ■ ■ ■ hi^ = mod p for every n mod p, 
hence \^{p)\ = p, and so Ti is not admissible if {Q,hi- ■ ■ hk) 7^ 1. If {Q,hi---hk) = 1, 
then P{n]'H) = hi---hk ^ mod p, and hence |i^(p)| = 0, for every p dividing Q. For 
every other p we have 1 ^ l^^d?)! ^ min(/c,p). Then for k ^ log if and p \ Q, we have 
1 ^ |^(p)| ^ fc ^ logH < p hj (2.3), hence Ti is admissible. 

Now assume H is large enough so that logiJ ^ 2k, and suppose {Q, hi- ■ ■ hk) = 1. Then for 
(4.4), since |f^(p)| = if p | Q, it suffices to show that 



|^(^)|^^-^) ^1 (4.5) 



e'in) := n f 1 

as H tends to infinity. We break &{l-i) into two products according as p | A or p f A, and 
use the fact that |fi(p)| = k for p \ QA: 

p|A 

In this product p — A; 7^ because, by (2.3), p \ Q implies p > logH ^ 2k. For the same 
reason, the logarithm of the first product of the last line of (4.6) is 



^\\ p 2p'^ ) V P V )\ p^hP^ log iJ log log if 

For the second product, note that since k/logH ^ 1/2, we have 

k-\n(p)\ k 2k 

< — ^^ ^ — - ^ — < 1. 

p — k p — k p 

Hence the logarithm of the second product is 



E, / k-\Q(p)\\ v^ A; A; v^ A; log A 

log 1 + — ^^ < V - < - — - Y 1 < - — z— ^^ — - < 
V p-k J ^ P log^^ logiiloglogA 



, ,- - , ,, ,- -c- ,, log if log log A log log if 

p|A ^ ' p|A p|A 

p>\ogH p>\ogH 
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by the prime number theorem, because A ^ H^'^) . Exponentiating and letting H tend to 
infinity yields (4.5). D 

We now assume all of the hypotheses of Proposition 2.2. The proof of (2.9) is almost identical 
to the proof of Lemma 1 of [3], the only difference being that primes p \ Qpo are excluded 
from the representation of F{si, S2', ^), where 



F(si,S2;fi) := ^ n{di)n{di] 



mdud 



2 



di,d2 



[di,d2\dl^d 



\^ip)\ ( 1 , 1 



= TT 1 

in the region of absolute convergence. Since |f2(p)| = /c for j» > i/ by (4.3), we put 



G(si,S2;fi) := F(si,S2;fi) 



C(S1 + 1)C(S2 + 1) 



k 



C(si + S2 + 1) 

In the proof of Lemma 1 of [3], G(0, 0; fi) = ©("H), but in our situation, we have 

G,o,o;o)^n(-^)n(-i)"'^e(«,n(i-^)^ 

because ((5,Po) = 1 and |fi(p)| = if p | Q. The last product is ~ 1 by (2.1). Now applying 
(4.4), and proceeding as in the proof of Lemma 1 of [3], (2.9) is established. 

The proof of (2.10) follows that of Lemma 2 of [3] very closely: there is one important 
difference concerning the error 



E*(N,q) := max max 

x^N {a,q)=l 



^ logp 



X 



p=a mod q 



The usual Bombieri- Vinogradov theorem will not suffice here, but the next lemma, which is 
Lemma 2 of [5], will. 

Lemma 4.2. Let Q be an integer and Y, M be numbers such that 

Q^ ^Y ^M, exp hy^hgM) ^ Y. (4.7) 

// there is an exceptional modulus go ^ Y , suppose Po \ Q for some po \ qo; otherwise, let 
Po = l. If 

R* := M^^^Q-^exp (-^/\ogM) , (4.8) 



then we have, with explicitly calculable positive constants c\ and C2, 

calogM 



E ^•(M.Qfl)«c.|exp(- 

D<:R* ^ \ to 

{D,Qpo)=l 



(4.9) 



By (2.2) - (2.5), we see that (4.7) is satisfied with 

Y = exp {2cH/i\ogH)'') = Ar2-(i+o(i))/(iogiogiv)2^ 

and M = 3QN. We also have 

^2 ^ ^1/2-2.' ^ ^* = (3QAr)^/2g^3g^p (-v/log3QiV^ 

for all sufficiently large A^, and 

cslogM/logF = C2(l + o(l))logA^/logr = C2(l + o(l))(loglogiV)V2ce. 
Letting c^ = CijVlce and putting this into (4.9), we deduce from Lemma 4.2 that 

^E\3QN,QD) < iV(logAr)-^"^'°s^°s^ (4.10) 

for all sufficiently large A^. 

Now, abbreviating Aij((i; A; + £) to A^, by (4.2) we have 

Y, mn + h)KR{n-n,k + lf=Y^\d,\d, Yl HQn + h) 



N<nsi2N di,d2 JV<n^2JV 

[d^,d2]\P{n;H) 



(4.11) 



= Yl ^'^^^'^^ Yl Yl ^°sp- 

di,d2 mmod[di,d2] QN+h<p^2QN+h 

en{[di,d2]) p=h mod Q 

p=Qm-\-h mod [di,d2] 

We may assume {Qm + h, [di, ^2]) = {Q, [di, ^2]) = 1 in the last sum, so we define 

n*{d) := n{d) \ {m mod d : {Qm + /i, d) ^ 1}. 

For di,d2 with {Q,[di,d2]) = 1 and ?7i mod [^1,6/2] € Q*{[di,d2]), we let /im mod Q[(ii, c?2] 
be the unique congruence class mod Q[di,d2] satisfying km = h mod Q and km = Qm + 
h mod [di, ^2]- Thus, the last sum in (4.11) is equal to 

p=hm mod Q[di,d2] 

and (4.11) becomes 

QN 



0(Q) 
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r* + 0{S*), (4.12) 



with 

^. := E'^^^^tS^^TT^' ^* ■= Y^\>^<iM\\^*i[dud,])\E*{3QN,Q[d,,d,]). 

Now from the definition (4.1) it is clear that |Ad| ^ (logi?)'^"'"^. Also, as we saw in the 
beginning of the proof of Lemma 4.1, since (Q, hi ■ ■ -hk) = 1 we have |f2(p)| ^ k for all p, 
and so \^*{d)\ ^ \n{d)\ ^ A;'^^ for squarefree d. Thus 

S* ^ (log Kf^'+'^ Y^ fi\D)k^^''^E*{3QN, QD) ^ 1 

= (log i?) '('=+') Y^ ^i\D){?>kY^'''^E*{?>QN,QD). 

By the trivial inequality 

B.(3QA..QZ,)««^^«^. 

and the Cauchy-Schwarz inequality, we have 



Y^ fx\D){3k)''^^^E*{3QN, QD) 



For positive integers k, we have 

Ds£R2 d-d^^R'2 ^ '^ 



SO combining and applying (4.10) yields 

^* « ^^ ^ LAn-..,ww^. ^ iV(logiV)-3iogiog.v_ (4_i3) 



We will now evaluate T*, assuming first that Qx + h ^ H. Let 'H"'" = TiU {Qx + /i} and 
observe that for p f Q, 

\n*{p)\ = \n{p;n+)\-i:=\n+{p)\-i. 

As with |f^((i)|, a Chinese remainder theorem argument shows that 1^2* ((i)| defines a multi- 
plicative function of d. Thus 

\n*{[di,d,])\= n (ifi+(p)i-i), 

P\[di,d2] 
10 



provided [^1,(^2] is squarefree and (Q, [^1,^2]) = 1, as is the case for di,d2 appearing in the 
sum defining T*. 

We now proceed as in the proof of Lemma 2 of [3]: again, the only modification necessary is 
to G(0, 0; n+). First note that 



Y V p / vp-i/ V pJ 




By (4.3), |fi+(p) = in+l = k + lioTp> H, and if 




C(^ ^--)- n (1 l""(^)-ir 1 ' 


1 \\ /c(si+i)c(s2+i)y 

pSi+S2jJ y (^(^Si + S2 + l) J 


Li[Si, S2, \L .— i 1 „ 
J.J. \ p — 1 V p^i »^i 






Q 



pIQ p\po 

k 



by Lemma 4.1 and (2.1). Therefore 

r-^(^Vn"°^^)'"'. (4.14) 

U(Q)/' U/' (* + 2C)! * ' 

We remark that since {Q,h) = {Q,hi- ■ ■ hk) = 1, 'H'^ is admissible (for all sufficiently large 
A^) by Lemma 4.1, so we do not have to consider the other case as in the proof of Lemma 2 
in [3]. Combining (4.14) with (4.13) and (4.12) yields the first case of (2.10). For the case 
Qx + /i G "H, we observe that, similarly to (2.2) of [3], we have 

Y^ ^{Qn + h)KR{n-H,k + Cf = ^ ^{Qn + h)KR{n-H\{Qx + h},k + ^f , 

N<nf^2N N<n^2N 

SO the above evaluation applies with the translation k\-^k — l,i\-^i+l to (4.14). 



5. Proof of Proposition 2.3 

5.1. Auxiliary lemmas. To prove Proposition 2.3, we will use the following lemmas. 
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Lemma 5.1. Fix integers q and a with (g, a) = 1. There is a constant c{q, a) > 0, depending 
only on q and a, such that 



^ pj [logxy/'t'^i') 

p=a mod q 



as X — 7- OO. 



Proof. This follows from the prime number theorem for arithmetic progressions. For a more 
precise estimate, with the constant c{q,a) given explicitly, see [10, Theorem 1]. D 

Lemma 5.2. Let S^{x) denote the set of positive integers which are ^ x and composed only 
of primes p = 1 mod q. There is a constant c{q) > 0, depending only on q, such that 

I^WI=(cfe) + o(^))^(log.)'/*). 

Proof. See [9, Lemma 3], in which the constant c{q) is given explicitly. D 

The next lemma concerns \l/(x, y), the number of positive integers which are ^ x and free 
of prime factors > y (y-smooth numbers). The ratio \l/(x,y)/x depends essentially on n = 
logo;/ logy, and for u in a certain range is approximated by p(u), where p{u) is the Dickman- 
de Bruijn p-function, defined as the continuous solution to 

,1 ^M ^ 1, 



Lemma 5.3. The estimate 



*<^^"-^^)-pMfl + 0fM!i±^H (5.2) 



y"" V V logy 

holds uniformly in the range 

y^3, l^n^exp((logy)3/^-'^), (5.3) 

where 5 is any fixed positive number. The estimate 

p{u) = exp {—ulogu — u log log u + 0{u)) (5.4) 

holds for u > 3, and 

^(y",y) 



yU 



exp (— nlogn — nloglogn + 0(n)) (5.5) 
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holds uniformly in the range 

3<u^ y^'\ (5.6) 

Finally, as y -^ cxd, 

^{y,{\ogy)^) 1 



(5.7) 
holds for any fixed number A> 1. 



y j/lM+o(l) 



Proof We refer to the survey article of Granville [7]. The asymptotic (5.2) was shown to hold 
for the range (5.3) by Hildebrand [8]: see [7, (1.8), (1.10)]. Hildebrand [8] also estabhshed 
that the less precise estimate 

^^ = p(n)exp {Os (uexp {-{loguf'^'))) 

holds, for any fixed number 5 > 0, in the wider range (5.6). (See displayed formulas [7, (1.11), 
(1.13)].) That (5.5) holds in the same range can be deduced from (5.4). (The estimate (5.5) 
is less precise, but sufficient for our purposes.) For the estimate (5.7), see [7, (1.14)]. 

The value of the Dickman-de Bruijn p-function is discussed in [7, 3.7 - 3.9], and (5.4) was 
proved by de Bruijn in [1]. D 

Lemma 5.4. Let ^ be a subset of the primes. As y ^ oo, the estimate 



n(-i) E^a-a))e-£ 



p{v)dv. (5.8) 



p^y ^ ' n>y 

p&^ p\n=>p^y 

p6^ 



holds uniformly for u satisfying 

M ^ 1, M = exp {{\ogyf'^~^) , (5.9) 

where 5 is any fixed positive number. 



Proof. Define 



11 \ T) I ^—^ T) 



If £ ^ y is prime, then 



J.J. \ pi J. — i ji 

p^y n^x 

pG.^ p\n^p^y 

p££i^ 



:^)^ n f-i)-(-r E^ 



■^■^ \ V \ ^ / — n 

p£i^U{£} p\n^p^y 

pG^ 
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Now 

-1 



-V v--(i-- iv->v — 



n \ a. P I ^-^ n ^-^ m 

p\n=^p-^y p\n=^pi^y p|m=>p^j/ 

pe^ pe.^ p&.0^u{£} 

because every m appearing in the last sum may be written as ni°' for some a ^ and some 
n appearing in the second last sum. Hence, 

Qix,y;^)^g{x,y;^U{i}), 

and applying this inequality repeatedly, we obtain 

p^y ^ n^x 

p\n=^p^y 

Subtracting both sides from g{oo,y; ^) = 1 = g{oo,y; {p ^ y}), we deduce that 

U(^-i) E z^U(^-z) E I ("0) 



pi i i Ji J.J. \ pi C ^ j^ 

p^y ^ '■ ' n>x p^y ^ ■■ ' n>x 

p&,9> p\n^p^y p|?i=>p^y 

pe.^ 



By partial summation. 



^-^^ "^ Jx '^ ^ Jx '' Jx ^ 

p\n=^p^y 

Now we assume x = y", with u satisfying (5.9) and y tending to infinity. We will divide the 
range of the last integral in (5.11) into three parts. First of all, fix any e G (0, 1) and suppose 
t ^ exp(y'^), that is y ^ (logt)^/'^. By (5.7) we have 

*(t,y) ^ ^(t,(logt)V^) _ 1 



t2 ^ t2 t^+e+o{l) 

as t, and hence as y, tends to infinity. Thus, we may suppose y is large enough so that 
^{t,y)/t^ ^ l/t^+^Z^ say, and 

^"•^'^'^^r J.^-.:.^:^. (5.12) 



exp(r) ^^ iexpcr) ^^^'^^ eexp(e7/72)' 

For the range x ^ t ^ exp(y'^), the substitution t = y"" yields 

dt = \ogy / — rfw. 5.13 
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Next, we let ui = 2exp ((log 7/)^/^ ''), and for ui ^ v ^ y^, we use the estimate (5.5): 

:^ — = exp(-t;logt;-t;loglogi; + 0(i;)) ^ — , 

where the last inequality holds for all sufficiently large v, hence for all sufficiently large y. 
Thus 

y'"^^y^{y\y) ^ prf„ 1 



(5.15) 



dv<: - « — (5.14) 

for all sufficiently large y. 
For M ^ f ^ Ml, we use the estimate (5.2): 

/•OO /"OO 

= (l + o(l))/ p{v) dv - {I + o{l)) p{v)dv. 

J u J ui 

By (5.4) we have, similarly to (5.14), the estimate 

r , ., ^ r dv 1 

Ai Ai V'' m/ 

for all sufficiently large y. 

Combining (5.11) - (5.16), we see that 

y -A^dt={l + o{l))\ogyJ p{v)dv + 0{u^'''\ogy) (5.17) 

for all sufficiently large y. Now by definition (5.1), 

/■■u+l 

p{v) dv ^ / p(f ) c/f = (m + 1)p(m + 1), 



(5.16) 



and by (5.4), m^ "^ = o((m + 1)p(m + 1)) as Mi ^ 2m, and Mi tends to infinity with y. Therefore, 
combining (5.17) with (5.11) in fact gives 



1 /"°° 

J2 -^(l + o(l))logy / p{v)dv 



{h.li 



n>y 



as y — 7- cxD, for u in the range (5.9). Finally, combining (5.18) with (5.10) and applying 
Mertens' theorem, we obtain (5.8). D 



5.2. The proof of Proposition 2.3. We are now ready to define Q explicitly. The con- 
struction is modelled on that of Shiu's [9]. For the rest of this section we let g ^ 3 and a be 
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integers with (g, a) = 1. If a = 1 mod q, let 

^{H) := {p^hgH ■.p=l mod g} U {p ^ H/{\ogHf : p ^ I mod g}, 

otherwise let 

^[H) := {p ^ log-ff : p = 1 mod g} U {p ^ H/{\ogH)^ : p ^ l,a mod g} 

U {t{H) ^p^ H/{\ogHf : p = 1 mod g} U {p ^ H/t{H) : p = a mod g}, 

with 

/'log if log log log if 

t(ii) := exp 



2 log log ii 
and put 



g(ii):=g n P' g = g(ii):=g J] P" (5-19) 

We check that (2.2) - (2.5) are indeed satisfied by Q: only (2.4) is not immediate, but it 
follows from the prime number theorem. 

Analogously to (2.12), we define 

S{H) := {h G (0, H] : {Q{H), h) = I and h = a mod g}, 

(5.20) 
T{H) := {h e (0, H] : {Q{H), h) = 1 and h ^ a mod g}. 

Proposition 2.3 will follow from the next lemma. 



Lemma 5.5. Let H be a real parameter tending to infinity, and let Q{H) he as in (5.19). 
We have 

\nH)\'^T^- (5.21) 

log if 

Moreover, there is a constant A = A{q), depending on q at most, such that for all sufficiently 
large X , there is some H satisfying 

^ <:h <:x, (5.22) 



(logX)^ 
such that 

\S{H)\»,H^^^. (5.23) 

' Q{H) 

The implied constant in (5.21) is absolute, and that in (5.23) depends on q at most. 

16 



Proof of Proposition 2.3. Let S{H) and T{H) be as in (2.12). If po 7^ 1 then by (2.1) there 
are at most H/po < H/\ogH multiples of po in T{H), so 

by (5.21). We also have \S{H)\ ^ \S{H)\. An application of Lemma 5.1 reveals that 



QW .it„,V- Py""|^(gS)"'*' ifa^lmod,. 
Therefore, in either case, combining (5.21) and (5.23) gives 

\sm - \Tm » \sm - \fm », h^^^ » h^^^^. 

Proposition 2.3 now follows from Lemma 5.5. D 



Proof of Lemma 5.5. We assume a ^ 1 mod q as the case a = 1 mod q is similar and simpler. 

There are <^ H/logH primes in T{H), so let us count the composites h G T{H). If 
h = pm for some prime p > H/{logHY, with m > 1, then m < (logH)^ is com- 
posed only of primes > logH and = 1 mod g, by the construction of 0^{H). Thus, m 
must be prime itself, and p ^ H/\ogH. We partition {H / {log HY , H / log H] into sub- 
intervals Ii = {e'-~^ H/ {log HY,e^H/ {log H)\ and {logH, (log if )^] into sub-intervals Ji = 
{logH, (logiJ)^/e'], 1 ^ / ^ log log if, and using the prime number theorem, we deduce that 
the contribution from elements with a large prime factor is at most 

F r F i« r ''" ^'"'"^^ «-g- 

/-^ /-^ /^ /-^ (logiJ)3eMoglogif log if 

p^l mod qp=l mod g 

If /i = pm with p = a mod g, then p > H/t{H), and m < t(if) must be composed only of 
primes = 1 mod q, a contradiction as /i ^ a mod q. The only elements left uncounted must 
be composed only of primes p = 1 mod q with log if < p < t{H). By (5.5), the number of 
such elements is at most 

^(if, t(if)) = H exp {—ulogu — n log log n + 0{u)) , 

where 

log ii" 2 log log H 

u 



log t{H) log log log if 
Thus 

u log u + u log log u + 0{u) ~ u log n ~ 2 log log if, 
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and so 

^{H,t{H))<^ ^ 



logH 
Combining these estimates yields (5.21). 

Now suppose H is in the range (5.22). To bound the size of S{H) from below we will first 
do the same for 

S'{X) := {h G (0, X] : (Q'(X), h) = 1 and h = a mod g}, 

where 

Q\X):=q Y[ p, ^'(X) := ^(X)\{j9^1ogX:j9 = lmodg}. 

Now pm e S'{X) if X/t{X) < p = a mod g and m G y{X/p). We partition {X/t{X),X] 
into sub-intervals J/ = {e''~^X/t{X),e''X/t{X)], 1 ^ / ^ logt(X), and deduce, using the 
prime number theorem for arithmetic progressions and Lemma 5.2, that 



is'wi? E E E 



liiKlogt(X) pGli m.e.J^(i(X)/e') 
p=a mod q 

> V — ^-^ aoetfXll^/^^") (5.24) 

» -^(logt(X))V^W. 
logX 

Now, we may write any h G S'{X) uniquely as h = dm, where d is composed only of primes 
p ^ logX with p = 1 mod q, and m G S{X). Thus, by (5.24), there is a constant ci(g) > 0, 
depending on q at most, such that for all sufficiently large X, 

ci(g)j^(logt(X))V^W ^ |S'(X)| = J2 E 1^ E \SiX/d)\. (5.25) 

p=l mod (jr p=l mod q 

The inequality on the right is not immediate: in fact ii Z ^ X, then S{X) fl (0, Z] C S'(Z). 
To see this, first note that as all of the functions used to define ^{X) are monotonically 
increasing with X, 

^{Z) C ^(X) U {t{Z) ^p^ t{X) ■.p = l mod q}. 

Suppose m G S{X) fl (0, Z], but m ^ >5(Z). Then p G ,^(Z) for some p \ m, but p ^ ^{X), 
so t(Z) ^ p ^ ti^) ^iid p = 1 mod q. Since ?7i = a ^ 1 mod g, there must be some p' \ m 
with p' ^ 1 mod q and p' ^ ?7i/p ^ Z/t{Z) ^ X/t(X). Then p' G <f^(X), a contradiction. 

18 



Suppose for a contradiction that for some constant C2(g) > 0, depending on q at most, we 
have 

3c2 (g) log X \ log log X J 



for all H in the range (5.22). Then 

V</'{g) 



V \~sixid)\<''^''^ ^ / iogt(x) Y/^(^) ^ 1 



p|d=>p<logX p|d=>p^logX 

p=l mod q p=l mod g 



cM X ( \ogtiX) Y^^'> yr A_iy' (5.27) 

3c2(g)logX VloglogXy* AL \ pj 



p^logX 
p=l mod q 

3 log A 
provided X is sufficiently large, and for a suitable choice of C2{q) (given by Lemma 5.1). 

Now, by the fundamental lemma of Brun's sieve, we have 

\S{x/d)\<t:^ n (i--) (5.28) 

for any d. If (logX)"^ < d ^ vX, then log{X/d) x logX, and applying Lemma 5.1 to the 
sieve upper bound (5.28), we see that 



(5.29) 



(logX)'^<d!iVX \ O O / {logX)'^<d!iVX 

p|d=>p^logJf p|d=^p^logX 

p=l mod q P=l mod q 

for some constant c^^q) > 0. By lemmas 5.4 and 5.1 respectively, we have 

E ^^ n (i-^) \i+o{i))e~^ r p{v)dv 

(\ogX)'^<d^^/X P^logX ^ ^^ •'^ 

v\d^P^\ogX P=lmod5 /g_3QX 

p=:l mod (jr 

/•oo 

^C4(g)(loglogX)i/'^('?M p{v)dv 

J A 

for some constant C4(g) > 0. Now by (5.4), 

p{y) df — !■ as A — )■ oo, 
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so we may choose A = A{ci{q), Cs^q), C4(g)) = A{q) so that 

Ja 4c3(g)c4(g) 

For any such A, combining (5.29) and (5.30) yields 

J2 \SiX/d)\ ^ ^]-^(logt(X))V^(«). (5.31) 



(logX)^<d<VX 
p=l mod q 



Finally, using Rankin's trick, we see that 



Vx<d^x Vx<d^x ^v^/ p^iog^ V /^ / 

p\d^p^logX p\d^p^logX 

p=l mod 5 



^X^/^expl V ^77 I ^^^^^exp(9(logX)l/3^ 



(5.32) 



^ ^5/6+o(l) 

by the prime number theorem. 

Combining (5.25), (5.27), (5.31), and (5.32), we obtain ci(g) ^ 2ci(g)/3, which is absurd. 
We conclude that for all sufficiently large X, there is some H in the range (5.22) for which 

log A \loglogA/ logii \loglogii/ 

A final application of Lemma 5.1 shows that this is ^g H(f){Q{H))/Q{H). D 

6. A LOWER BOUND 

In this section we will show how to obtain a quantitative version of Theorem 1.1. We will 
use the assumptions and notation of sections 3-5, and show that 

\{Pr+i ^ Y : pr+i =pr = a mod q and Pr+i - Pr < e\ogPr}\ > ri/3{iogiogy)^ ^^^^^ 

for all sufficiently large Y . Here A = A{q) is the constant given in Lemma 5.5. This lower 
bound could be improved by a sharpening of the range (5.22) for H. 

We will ffist prove that the estimate 

^ A{n;n,k + iy^N{\ogNy^''+^' (6.2) 

N<nii2N 
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(6.3) 



holds, with an absolute implied constant. For by (4.1) and (4.2), 

N<n!^2N di,...,d4 N<n^2N 

[di,...,d4]\P{n;n) 

= 5^ ^di ■ ■ ■ ^d4 Yl Yl ^ 

di,...,d4 m mod [di,...,d4] N<nsi2N 

en([di,...,d4]) n=mmod [di,...,d4,] 

i E IV ••■A.J E (i4:^+«(^^ 

di,...,d4 m mod [di,...,Q!4] 

squarcfree en([di,...,d4]) 

«iv(iogi?)^(^+^) V '^ii^^'-'-'/t^^' . 

squarcfree 

To see the last inequality, note that [di, ...,d4\ ^ R^ = N'^~'^'^ = o{N), and so N/[di, ..., ^4] + 

0(l)<iV/[di,...,d4]. 



As observed in Section 4, |f2((i)| ^ A;'^('') for squarefree d, so 



^ \n{[d,,...,d,])\ ^ ^ /.^(D)H^) ^ 

Z^ W, ^j ^ Z^ n Z^ ^ 



di,...,d4^-R 
squarefree 



D^R-i 




[rfl,... 


....,di 






E 


ii\D){ibky 

D 

D4Nl5fc 


'^{D) 


<11 




Ihk 
P 



(6.4) 



Since R^ < N, combining (6.3) and (6.4) yields (6.2). 

Now choose A^ so that (3.2) holds. If we restrict the outer sum in the definition of Jf to 
those n for which {Qn, Qn + H] contains a prime string Pr+i = Pr = a mod q, we remove no 
positive terms. Thus, if ^* denotes this restricted sum, then 

^^ 

^ ^ ^ Af<n.s£2Af \heS heT / 

For each n G (A^,2A^], 

Y ^{Qn + h)-J2 ^{Qn + h)- log 3QN ^ H log 3QN, (6.6) 



hes heT 
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and by the Cauchy-Schwartz inequality, 

Y,* ARin-n.k + if^l 5^* Ij ( Yl Mn;'H,k + iA . (6.7) 



N< 

Combining (6.5) - (6.7) yields 



-1 

\4 



Y l>N\Q/<l){Q)f''^\H\og3QN)-^i J^ Mn;n,k + 

N<n<:2N \N<ni:2N 

Using H = elogN, \og3QN = (1 + o(l))logA^, and Q/4>{Q) ^ 1, then applying (3.2) and 
(6.2), we see that the right-hand side is '^k,q N/{\ogNy^''~^'^. Since k depends on e, we may 
write 

where Bie) is a constant depending on e. 

Now fix a large number F, and let 

Ice - ' 



X:=e\l+ logy, 

V (loglogF)V 

with c > fixed. By Lemma 5.5, we may choose H in the range 

X/(logX)^ ^H ^X 
so that (3.2), hence (6.1), holds with N = exp{H/e). By (2.4), 



because 



H cH H f ce \ X f 2ce , , ^^ 



e ilogHf e V i}ogHfJ e V (log log F)^ 
Here we have used \ogH = (1 + o(l)) logX = (1 + o(l)) log log F. Also, 

logA^ = H/e ^ X/e(logX)^ ^ logF/2(loglogy)^. 

Therefore, using (6.8) as a lower bound for the number of prime strings up to F, we deduce 
(6.1). (At best, we may have H = X, in which case we could deduce a lower bound of 
yi-c'/{iogiogy)2 fQ^ gQ^g constant c' > 0.) 



22 



7. Concluding remarks 

Proposition 2.2 is similar to a special case of Propositions 1 and 2 of [5], which are used to 
prove that 

hminf ^V^'"/ ^e""(v^-l)'. 

r-^co (p[q) iogp; 

where p', denotes the jth smallest prime in the arithmetic progression a mod q, {q,a) = 1. 
By considering H^ = {u — 1 + e) logN instead oi H, Q = Q{H^) instead of Q{H), and 

^ ^ ^ N<n^2N \heS heT J 

instead of =Sf , it is possible to prove that the interval {Qn, Qn + Hy\ contains a string of 
V + 1 consecutive primes = a mod g, for some n G (A^, 2N] and a sequence A^ — ?• oo. It may 
be feasible to prove a similar result with H^ = {e^"'{^^Jv — 1)^ + e) log TV. 
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